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PRODUCTION OF RECOMBINANT MONELLIN USING 
\/»rTHVI OTROPHir- VF AST KXPRFSSION SYSTEM 

This application claims the benefit of priority under 35 U.S.C. § 1 19(e) to 
U.S. provisional application Serial No. 60/114,529 to Lingxun Duan, filed 
December 31, 1998, and entitled PRODUCTION OF RECOMBINANT 
MONELLIN USING METHYLOTROPHIC YEAST EXPRESSION SYSTEM. 

1. FIFT D OF T wy INVENTION 

The present invemion relates to a single-chain monellin-like protein which is 
stable and which is at least 100-fold sweet as compared to sucrose on the weight 
basis. The present invention also relates to a nucleic acid encoding said monellin- 
like protein. Preferably, the nucleic acid further comprises a promoter and a signal 
sequence for directing expression and secretion of the encoded monellin-like 
protein in the methylotrophic yeast Pichia pastoris. The presem invention further 
relates to a recombinant Pichia pastoris cell containing the nucleic acid encoding 
the monellin-like protein, a process for producing the monellin-like protein from 
the recombinant Pichia pastoris and product of the process. 



20 



2. RArKr.ROUND ART 
2.1. MONELLIN 

Monellin belongs to a family of intensely sweet proteins derived from 
tropical plants (Dansby, Nature Biotechnology, 1997, 15:419-420). Monellin is 
about 3,000-fold sweet as compared to sucrose. Other similar proteins include 

25 thaumatin, miraculin, mabinlin, pentadin and aspartame (Id.) Monellin was first 
isolated from the West African Plant Dioscoreophyllum comminisii (U.S. Patent 
Nos. 3.878,184 and 3,998,798; Morris and Cagan, Biochim. Biophys. Acta, 1972, 
261:114-122). The amino acid sequence, the three-dimensional structure and 
various physical and chemical properties of monellin have been characterized 

30 (Ogata, et al.. Nature, 1987, 328:739-742; Morris et al., J. Biol. Chem., 1973, 
248:534-539; Cagan, Science, 1973, 181:32-35; Bohak and Li, Biochim. Biophys. 
Acta, 1976, 422:153-170; Hudson and Beeman, Biochem. Biophys. Res. Comm., 
1976, 71:212-220; Van der Wei and Loeve, FEBS Lett., 1973, 29:181-183; and 
Frank and Zuber, HoppeSeylerS Z Physiol. Chem., 1976, 357:585-592). 
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U.S. Patent No. 4,300,576 discloses smoking articles containing thaumatin 
or monellin. U.S. Patent No. 4,562,076 discloses chewing gum with coating of 
thaumatin or monellin. U.S. Patent No. 4,412,984 discloses flavor potentiated oral 
compositions containing thaumatin or monellin. However, despite its potential as 
5 low-calorie sweeteners, wide commercial application of monellin is hampered by 
concerns over its poor stability to heat and pH, lack of access to sources of supply 
of the plant and uncertainty in the regulatory climate for food additives (Dansby, 
Nature Biotechnology, 1997, i5:41 9-420). 

In 1989, Sung-Hou Kim's group reported production of single-chain 
10 monellin in E. coli by genetic engineering (Kim et al.. Protein Eng., 1989, 2:571- 
575). The purified single-chain monellin was found to be more heat-stable and 
tolerant to a wide pH range, but retained the intensity of sweetness. Several 
aspects of this invention have been the subject of certain U.S. patents. For 
example, U.S. Patent No. 5,234,834 discloses constructs for expression of single- 
15 chain monellin in plant cells. U.S. Patent No. 5,487,923 discloses a sweet 

proteinaceous compound of the formula B-C-A, wherein B represents a peptide 
portion at least 90% homologous to residues 1-46 of the B chain of native monellin 
and modified only by conservative substitutions; C is a covalent bond or is a 
hydrophilic, physiologically acceptable covalent linker capable of providing a 
20 spacing length equivalent to a peptide of 1-10 amino acids selected so as to reside 
on the external portion of the molecule and not to disturb the native conformation; 
and A represents a peptide at least 90% homologous to residues 6-45 of the A 
chain of native monellin and modified only by conservative substitution. U.S. 
Patent No. 5,487,983 discloses an expression system for making the single-chain 
25 monellin disclosed in U.S. Patent No. 5,487,923. U.S. Patent No. 5,670,339 
discloses DNA encoding the single-chain monellin disclosed in U.S. Patent No. 
5,487,923. U.S. Patent No. 5,672,372 discloses methods for sweetening a food 
composition with the single-chain monellin disclosed in U.S. Patent No. 5.487,923. 
U.S. Patent No. 5,264,558 discloses a single-chain monellin protein that is, in a 
30 standard taste test, at least 50 times that of sucrose on a weight basis. 

Recently, Kondo et al.. Nature Biotechnology, 1997, 15:453-457 discloses 
heterologous expression of a single-chain monellin protein in the yeast Candida 
utilis intracellularly. It reports that monellin was produced at a high level, 
accounting for >50% of the soluble protein. 
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2.2. EXPRESSION OF HETEROLOGOUS PROTEINS 
IN PICHIA PASTORIS 

The methylotrophic yeast Pichia pastoris has been used as a protein 
5 expression system. Several aspects of this expression system have been the subject 
of certain U.S. patents. For example, U.S. Patent No. 4,837,148 discloses 
autonomous replication sequences for Pichia pastoris. U.S. Patent No. 4,855,231 
discloses regulatory region for heterologous gene expression in Pichia pastoris 
cells. U.S. Patent No. 4,882,279 discloses site selective genomic modification of 

10 Pichia pastoris. U.S. Patent No. 4,929,555 discloses a method for making whole 
cells of Pichia pastoris competent for transformation. U.S. Patent No. 5,122,465 
discloses a process for generating a selectable phenotype in strains of Pichia 
pastoris. U.S. Patent No. 5,324,639 discloses production of insulin-like growth 
factor- 1 in methylotrophic cells, including Pichia pastoris cells. 

15 A number of signal sequences have been used to direct secretion of 

heterologous proteins expressed in Pichia pastoris cells. Examples of such signal 
sequences include, but are not limited to, the signal sequence of Pichia pastoris 
acid phosphatase, the signal sequence of Aspergillus giganteus alpha-Sarcin 
(Martinez-Ruiz et al.. Protein Expr. Purif., 1998, 1201:315-22; Abdulaev et al., 

20 Protein Expr. Purif., 1997, 10(0 :61-9; Kotake et al., J. Lipid Res., 1996, 
37£3): 599-605), the signal sequence of alpha-N-Acetylgalactosaminidase 
(alphaNAGAL, EC 3.2.1.49) (Zhu et al.. Arch. Biochem. Biophys., 1998, 
352(l) :l-8), the signal peptide of the OmpA protein (Heim el al., Biochim. 
Biophys. Acta., 1998, 1396(3) :306-19), the signal sequence of the mouse 

25 alpha- factor signal (cCell) or the native signal sequence of pepper 

endo-beta-l,4-glucanases (Ferrarese et al., FEBS Lett., 1998, 422(1 ):23-6), signal 
peptide of laccase isolated from the ligninolytic fungus Trametes (Jonsson et al., 
Curr. Genet., 1997, 32(6) :425-30), signal peptide of murine lysosomal acid 
alpha-mannosidase (Merkle et al., Biochim. Biophys. Acta., 1997, 1336(2) : 132-46), 

30 signal peptide of the porcine inhibitor of carbonic anhydrase (Wuebbens et al.. 
Biochemistry, 1997, 36(14) :4327-36), signal sequence of Aspergillus awamori 
glucoamylase (Fierobe et al., Protein Expr. Purif., 1997, 9(2): 1 59-70), signal 
sequence of mouse major urinary protein (Ferrari et al., FEBS Lett., 1997, 
401(n :73-7), signal sequence of phol (Skory et al., Curr. Genet., 1996, 
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30£5}:4 17-22), signal sequence of rabbit angiotensin-converling enzyme (ACE) 
(Sadhukhan et al., J. BioL Chem., 1996, 27U3n :18310-3), prepeptide sequence of 
Pichia pastoris aspartic proteinase (Tsujikawa et al.. Yeast, 1996, 12(6):541-S3), 
signal sequence Pichia pastoris PRCl (Ohi et al.. Yeast, 1996, 12(1):31^40), the 
5 signal sequence of a bacterial thermostable alpha amylase and SUC2 gene signal 
sequence from Saccharomyces cerevisiae (Paifer et al.. Yeast, 1994, 10(1 1):1415-9) 
and the signal sequence of Saccharomyces cerevisiae mating pheromone a-factor 
(Fidler et al., 7. Moi EmiocrinoL, 1998, 21(3) :327-336). 

Although the methylotrophic yeast Pichia pastoris has been used 

10 successfully for the production of various heterologous proteins, U.S. Patent No. 
5,324,639 discloses that at the present level of understanding of methylotrophic 
yeast expression systems, it is unpredictable whether a given gene can be expressed 
to an appreciable level in such yeast or whether the yeast host will tolerate the 
presence of the recombinant gene product in its cells. U.S. Patent No. 5,324,639 

15 further discloses that it is especially difficult to foresee if a particular protein will 
be secreted by the methylotrophic yeast host, and if it is, at what efficiency. For 
example, Vollmer et al., J, ImmunoL Methods. 1996, 199(1 ) :47-54, reports that 
when the 323 amino acid residues of the human sIL-6R are inserted into an 
expression/secretion vector suitable for the methylotrophic yeast Pichia pastoris. no 

20 detectable expression and secretion of the recombinant protein was obtained. Up to 
date, monellin has not been expressed and secreted using the Pichia pastoris 
expression system. 

Given the great interest in the commercial application of monellin, there is a 
great need for a more efficient method for producing stable monellin which still 
25 retains its native sweet intensity and which simplify down stream purification 
procedures. The present invention addresses these and other needs in the art. 
Citation of references hereinabove shall not be construed as an admission that such 
references are prior art to the present invention. 

30 3. SUMMARY OF THE INVENTION 

The present invention relates to an isolated nucleic acid comprising a 
nucleotide sequence encoding a chimeric protein, said chimeric protein comprises, 
from N-terminus to C-terminus: a) a first peptidyl fragment consisting of an amino 
acid sequence that has at least 40% identity to residues 1-50 of the B chain of 
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native moneilin, in which the percentage identity is determined over an amino acid 
sequence of identical size to the B chain of native moneilin; b) a peptidyl bond, or 
a second peptidyl fragment consisting of 1-12 amino acids; and c) a third peptidyl 
fragment consisting of an amino acid sequence that has at least 40% identity to 
5 residues 1-45 of the A chain of native moneilin, in which the percentage identity is 
determined over an amino acid sequence of identical size to the A chain of native 
moneilin, wherein said chimeric protein is stable and a given amount of said 
chimeric protein is at least 100-fold sweet as compared to the identical amount of 
sucrose, and within said nucleic acid, codons which are preferably used by yeast 

10 cells are used. Preferably, the isolated nucleic acid further encodes a promoter 

which is capable of directing protein expression in Pichia pastoris and/or an amino 
acid sequence which is capable of directing secretion of the encoded chimeric 
protein from Pichia pastoris. 

The present invention also relates to a recombinant Pichia pastoris cell 

15 containing the above nucleic acids. The present invention further relates to a 

process for producing a chimeric protein comprising growing a recombinant Pichia 
pastoris cell containing the above nucleic acid such that the encoded chimeric 
protein is expressed and secreted by the cell, and recovering the expressed and 
secreted chimeric protein. Finally, the present invention relates to products of the 

20 above processes. 



4. BRIEF DESCRIPTION OF THE DRAWINGS 

FIG. 1 shows the amino acid sequence of a recombinant single-chain 
moneilin protein and the DNA sequence encoding the recombinant single-chain 
25 moneilin protein. Amino acid residues 1-50 corresponds to the amino acid residues 
1-50 of the B chain of native moneilin; amino acid residue 51 is Glycine as the 
linker; and amino acid residues 52-96 corresponds to the amino acid residues 1-45 
of the A chain of native moneilin. 

FIG. 2 shows the DNA sequence of the oligos which were used for 
30 synthesis of the recombinant single-chain moneilin protein. 

FIG. 3 shows the location of each of DNA oligo in the synthesized moneilin 
DNA and its enzymatic digestion sites. 

FIG.4 shows the restriction map of recombinant moneilin protein expression 
vector pGWYSl. 
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6 

FIG. 5 shows the construction of pGWYSl. 

FIG.6 shows the SDS-PAGE analysis of the secreted recombinant monellin 
protein isolated from the culture medium. 

FIG. 7 shows the steps for purifying the secreted recombinant monellin 

5 protein. 



5, DETAILED DESCRIPTION OF THE INVENTION 

The present invention provides a nucleic acid encoding a single-chain 
monellin-Iike protein which is stable and which is at least 100-fold sweet as 

10 compared to sucrose on the weight basis. Preferably, the nucleic acid further 

comprises a promoter and a signal sequence for directing expression and secretion 
of the encoded monellin-like protein in the methylotrophic yeast Pichia pastoris. 
The present invention also provides a recombinant Pichia pastoris cell containing 
the nucleic acid encoding the monellin-like protein, a process for producing the 

15 monellin-like protein from the recombinant Pichia pastoris and product of the 
process. 

For clarity of disclosure, and not by way of limitation, the detailed 
description of the invention is divided into the subsections which follow. 

20 5.1. NUCLEIC ACIDS ENCODING 

THE SINGLE-CHAIN MONELLIN PROTEINS 

The present invention provides an isolated nucleic acid comprising a 

nucleotide sequence encoding a chimeric protein, said chimeric protein comprises, 
from N-temiinus to C-terminus: a) a first peptidyl fragment consisting of an amino 

25 acid sequence that has at least 40% identity to residues 1-50 of the B chain of 

native monellin, in which the percentage identity is determined over an amino acid 
sequence of identical size to the B chain of native monellin; b) a peptidyl bond, or 
a second peptidyl fragment consisting of 1-12 amino acids; and c) a third peptidyl 
fragment consisting of an amino acid sequence that has at least 40% identity to 

30 residues 1-45 of the A chain of native monellin, in which the percentage identity is 
determined over an amino acid sequence of identical size to the A chain of native 
monellin, wherein said chimeric protein is stable and a given amount of said 
chimeric protein is at least 100- fold sweet as compared to the identical amount of 
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sucrose, and within said nucleic acid, codons^ which are preferably used by yeast 
cells are used. 

In a specific embodiment, the present invention provides an isolated nucleic 
acid comprising a nucleotide sequence encoding the chimeric protein wherein the 
5 first peptidyl fragment consists of an amino acid sequence that has at least 60% 
identity to the B chain of native monellin. Preferably, the first peptidyl fragment 
consists of an amino acid sequence that has at least 90% identity to the B chain of 
native monellin. More preferably, the first peptidyl fragment consists of the amino 
acid residues 1-50 of the B chain of native monellin. 

10 In another specific embodiment, the present invention provides an isolated 

nucleic acid comprising a nucleotide sequence encoding the chimeric protein 
wherein the second peptidyl fragment consists of the amino acid sequence Gly-Gly- 
Gly-Ser-GIy-Gly-Gly-Ser-Gly-Gly-GIy-Ser (SEQ ID NO:l). Preferably, the second 
peptidyl fragment consists of the amino acid sequence Gly-Gly-Gly-Ser (SEQ ID 

15 NO:2). More preferably, the second peptidyl fragment consists of amino acid 
residue Gly. 

In still another specific embodiment, the present invention provides an 
isolated nucleic acid comprising a nucleotide sequence encoding the chimeric 
protein wherein the third peptidyl fragment consists of an amino acid sequence that 

20 has at least 60% identity to the A chain of native monellin. Preferably, the third 
peptidyl fragment consists of an amino acid sequence that has at least 90% identity 
to the A chain of native monellin. More preferably, the third peptidyl fragment 
consists of the amino acid residues 1-45 of the A chain of native monellin. 

In a preferred embodiment, the present invention provides an isolated 

25 nucleic acid comprising a nucleotide sequence encoding the chimeric protein 

wherein the first peptidyl fragment consists of the amino acid residues 1-50 of the 
B chain of native monellin, the second peptidyl fragment consists of the amino acid 
residue Gly and the third peptidyl fragment consists of the amino acid residues 1- 
45 of the A chain of native monellin. 

30 In a specific embodiment, the present invention provides an isolated nucleic 

acid comprising a nucleotide sequence encoding the chimeric protein which is 
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capable of being immunoreactively bound by an anti-monellin or an anti-thaumatin 
antibody. 

In another specific embodiment, the present invention provides an isolated 
nucleic acid comprising a nucleotide sequence encoding the chimeric protein 
5 wherein the chimeric protein further comprises an amino acid sequence which is 
capable of directing secretion of said chimeric protein from Pichia pastoris. 
Preferably, the secretion-directing sequence is an endogenous signal sequence of 
Pichia pastoris. More preferably, the endogenous signal sequence is selected from 
the group consisting of the signal sequence of Pichia pastoris acid phosphatase, 
10 Pichia pastoris aspartic proteinase and Pichia pastoris carboxypeptidase Y encoded 
by Pichia pastoris PRCl. Alternatively, the secretion-directing sequence is a yeast 
signal sequence, wherein said yeast is not Pichia pastoris. Preferably, the yeast 
signal sequence is a signal sequence from Saccharomyces cerevisiae. More 
preferably, the Saccharomyces cerevisiae signal sequence is selected from the group 
15 consisting of the signal sequence of Saccharomyces cerevisiae SUC 2 and 
Saccharomyces cerevisiae mating pheromone a-factor. Most preferably, the 
Saccharomyces cerevisiae signal sequence is the signal sequence of 
Saccharomyces cerevisiae mating pheromone a-factor. Examples of other 
secretion-directing sequences that can be used in the present invention include, but 
20 are not limited to, the signal sequence of Aspergillus giganteiis alpha-Sarcin, 

alpha-N-Acetylgalactosaminidase, OmpA protein, the mouse alpha- factor (cCell), 
the pepper endo-beta-l,4-glucanases, the laccase isolated from the ligninolytic 
fungus Trametes^ murine lysosomal acid alpha-mannosidase, the porcine inhibitor 
of carbonic anhydrase, Aspergillus awamori glucoamylase, mouse major urinary 
25 protein, phol, rabbit angiotensin-converting enzyme (ACE), and the bacterial 
thermostable alpha amylase. 

In a specific embodiment, the present invention provides an isolated nucleic 
acid comprising a nucleotide sequence encoding the chimeric protein which nucleic 
acid is a DNA. In another specific embodiment, the present invention provides an 
30 isolated nucleic acid which is hybridizable to the DNA sequence encoding the 
chimeric protein. In still another specific embodiment, the present invention 
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provides an isolated nucleic acid comprising a nucleotide sequence complementary 
to the nucleotide sequence encoding the chimeric protein. 

In a specific embodiment, the present invention provides a DNA encoding 
the chimeric protein which DNA further comprises a promoter which is capable of 
5 directing protein expression in Pichia pastoris. Preferably, the promoter is an 
endogenous promoter of Pichia pastoris. More preferably, the endogenous 
promoter is the promoter of Pichia pastoris glyceraldehyde-3-phosphate 
dehydrogenase (GAP). Alternatively, although not preferred, promoters of 
methanol responsive genes in methylotrophic yeast can also be used. Examples of 

10 such methanol responsive promoters include, but are not limited to, the promoter 
for the primary alcohol oxidase gene from Pichia pastoris AOXl, the promoter for 
the secondary alcohol oxidase gene from Pichia pastoris AOX2, the promoter for 
the dihydroxyacetone synthase gene from Pichia pastoris (DAS), the promoter for 
the P40 gene from Pichia pastoris, the promoter for the catalase gene from Pichia 

15 pastoris, and the like (see U.S. Patent No. 5,324,639). 

In another specific embodiment, the present invention provides a DNA 
encoding the chimeric protein which DNA further includes sequences allowing for 
its replication and selection in bacteria. In this way, large quantities of the DNA 
fragment can be produced by replication in bacteria. 

20 In a preferred embodiment, the present invention provides a DNA encoding 

the chimeric protein, wherein within the encoded chimeric protein, the first peptidyl 
fragment consists of the amino acid residues 1-50 of the B chain of native 
monellin, the second peptidyl fragment consists of the amino acid residue Gly and 
the third peptidyl fragment consists of the amino acid residues 1-45 of the A chain 

25 of native monellin, and said DNA further comprises the promoter of Pichia 
pastoris GAP and the signal sequence of Saccharomyces cerevisiae mating 
pheromone a- factor. 

In another preferred embodiment, the present invention provides a DNA 
encoding the chimeric protein, wherein the codons which are preferably used by 

30 Pichia pastoris cells are used. 
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In a most preferred embodiment, the present invention provides a DNA 
encoding the chimeric protein wherein the DNA molecule comprises nucleotide 
sequence as depicted in Figure 1 or the DNA vector as depicted in Figure 4. 

The nucleic acid comprising a nucleotide sequence encoding the chimeric 
5 protein disclosed herein, or any fragments, analogues or derivatives thereof, can be 
obtained by any method(s) known in the art. The nucleic acid may be chemically 
synthesized entirely. Alternatively, the nucleic acid encoding each fragment of the 
chimeric protein, /.e., the first, second or third peptidyl fragment, may be obtained 
by molecular cloning or may be purified from the desired cells. The nucleic acid 
10 encoding each fragment of the chimeric protein may then be chemically or 

enzymatically ligated together to form the nucleic acid comprising a nucleotide 
sequence encoding the chimeric protein disclosed herein, or any fragments, 
analogues or derivatives thereof. 

Any Dioscoreophyllum comminisii cell potentially can serve as the nucleic 
15 acid source for the isolation of the nucleic acids encoding monellin. Alternatively, 
the nucleic acids encoding monellin can be designed and synthesized according to 
the amino acid sequence of the native monellin depicted in Figure 1 i^see also U.S. 
Patent No. 5,478,923). 

The DNA may be obtained by standard procedures known in the art from 
20 cloned DNA {e.g.^ a DNA "library"), by chemical synthesis, by cDNA cloning, or 
by the cloning of genomic DNA, or fragments thereof, purified from the desired 
cell {See^ for example, Sambrook et al., 1989, Molecular Cloning, A Laboratory 
Manual, 2d Ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New 
York; Glover, D.M. (ed.), 1985, DNA Cloning: A Practical Approach, MRL Press, 
25 Ltd., Oxford, U.K. Vol. I, IL) Clones derived from genomic DNA may contain 
regulatory and intron DNA regions in addition to coding regions; clones derived 
from cDNA will contain only exon sequences. Whatever the source, the gene 
should be molecularly cloned into a suitable vector for propagation of the gene. 

In the molecular cloning of the gene from cDNA, cDNA is generated from 
30 totally cellular RNA or mRNA by methods that are well known in the art. The 
gene may also be obtained from genomic DNA, where DNA fragments are 
generated (e.g., using restriction enzymes or by mechanical shearing), some of 
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which will encode the desired gene. The linear DNA fragments can then be 
separated according to size by standard techniques, including but not limited to, 
agarose and polyacrylamide gel electrophoresis and column chromatography. 

Once a nucleic acid comprising a nucleotide sequence encoding the chimeric 
5 protein disclosed herein, or any fragments, analogues or derivatives thereof, has 
been obtained, its identity can be confirmed by nucleic acid sequencing (by any 
method well known in the art) and comparison to the known sequences. DNA 
sequence analysis can be performed by any techniques known in the art, including 
but not limited to the method of Maxam and Gilbert (Maxam and Gilbert, 1980, 

10 Meth. Enzymol, 65:499-560), the Sanger dideoxy method (Sanger et al., 1977, 

Proc, Natl Acad Sci. U,S.A., 74:5463), the use of T7 DNA polymerase (Tabor and 
Richardson, U.S. Patent No. 4,795,699), use of an automated DNA sequenator 
{e.g.. Applied Biosystems, Foster City, CA) or the method described in PCT 
Publication WO 97/15690. 

15 Nucleic acids which are hybridizable to a nucleic acid comprising a 

nucleotide sequence encoding the chimeric protein disclosed herein, or any 
fragments, analogues or derivatives thereof, can be isolated, by nucleic acid 
hybridization under conditions of low, high, or moderate stringency {See also Shilo 
and Weinberg, 1981, Proc. Natl Acad, Set. USA, 78:6789-6792). 

20 As used herein, ''stable" means that a claimed single-chain monellin 

chimeric protein retains at least 70% of its sweet intensity after the protein has 
been placed at about 4X for at least 6 months, or at about 60°C for at least 2.5 
hours, or at about 100°C for at least 5 minutes. In addition, "stable" means that a 
claimed single-chain moneriin chimeric protein retains at least 70% of its sweet 

25 intensity after the protein has been placed at a pH raging from about 2.0 to about 
11.0 for at least 6 hours. 

Sweetness of the claimed single-chain monellin chimeric protein can be 
assessed using an ordinary taste test that is known in the art. For example, 
comparison to the sweetness of sucrose can be made by suitable dilutions on a 

30 weight basis {see also U.S. Patent No. 5,478,923). 

The preferred codon usage by yeast cells can be determined by methods 
known in the art, e.g,, methods disclosed in Sharp et al., Nucleic Acids Res., 1986, 
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i4Q3}:5125-43 and in Li and Luo, J, Theor, BioL^ 1996, 1 8U2) :1 1 1-24. 
According to Sharp, important characteristics of the preferred codon in yeast 
include a higher correlation with tRNA abundance, a greater degree of third base 
pyrimidine bias, and a lesser tendency to the A+T bas pairs. Li and Luo discloses 
5 a method of classifying and predicting the gene expression level in E. coli and 
yeast cells which is called the Self-Consistent Information Clustering (SCIC). 
Using the modified Codon Adaption Index (CAI) values, Li and Luo have 
accomplished the linear regression analysis on the relation between base 
composition, base correlation and gene expression level in Escherichia coli and 
10 yeast. Li and Luo also proposed the assumption of Expression-Enhancing-Network 
Site (EENS), the existence of which can be demonstrated by the linear equations 
between gene expression and base correlations in a codon, in adjacent codons and 
in non-adjacent codons. In addition, the codons that have been successfully used 
for expressing heterologous proteins in Pichici pastoris cells can be used. Examples 

15 of such codons can be found in U.S. Patent No. 4,837,148; U.S. Patent No. 

4,855,231; U.S. Patent No. 4,882,279; U.S. Patent No. 4,929,555; U.S. Patent No. 
5,122,465; U.S. Patent No, 5,324,639; Martinez-Ruiz et al.. Protein Expr. Purif, 
1998, 12(3) :3 15-22; Abdulaev et al.. Protein Expr. Purif,, 1997, 10fl) :61-9; 
Kotake et al., J. Lipid Res,, 1996, 37(3):599.605; Zhu et al„ Arch, Biochem. 

20 Biophys., 1998, 352(0 :1-8; Heim et al., Biochim. Biophys. Acta., 1998, 

1396f3) :3Q6-19; Ferrarese et al., FEBS Lett,, 1998. 422fn :23-6: Jonsson et al., 
Curr. Genet., 1997, 32(6) :425-30; Merkle et al., Biochim. Biophys. Acta., 1997, 
1336(2) : 132-46; Wuebbens et al.. Biochemistry, 1997, 36(14^ :4327-36; Fierobe et 
al.. Protein Expr, Purif., 1997, 9£2}: 159-70; Ferrari et al., FEBS Lett,, 1997, 

25 401(0 :73-7; Skory et al., Curr, Genet., 1996, 30(5) :4 17-22: Sadhukhan et al., J. 
Biol. Chem,, 1996, 271(30 :18310-3: Tsujikawa et al.. Yeast, 1996, 12(6) :541-53; 
Ohi et al., Yeast, 1996, 12(0 :31-40: Paifer et al.. Yeast, 1994, 10(10 :1415-9; 
Fidler et al., J. Mol. Endocrinol,, 1998, 21(3> :327-336: and Brocca et al.. Protein 
Sci„ 1998, 7£6}: 1415-22. 

30 Whether a chimeric protein is capable of being immunoreactively bound by 

an anti-monellin or an anti-thaumatin antibody can be determined by methods 
known in the art. The examples of anti-monellin or an anti-thaumatin antibodies 
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that can be used in the present invention include, but are not limited to, the 
antibodies disclosed in Slootstra et aL, Chem. Semises, 1995, 20f5) :535-43; 
Antonenko and Zanetti , Life ScL, 1994, 55(15) :! 187-92: Bodani et al., Hybridoma, 
1993, 12(2) : 177-83: Mandal et al., Hybhdoma, 1991, 10(4) :459-66 and Haimovich, 
5 Isr. J, Med, ScL, 1975, 1U11) :1183. 

5.2. PRODUCTION OF MONELLIN PROTEINS 
FROM RECOMBINANT PICHIA PASTORIS CELLS 

In a specific embodiment, the present invention provides a recombinant 

10 Pichici pastoris cell containing the nucleic acid which encodes a chimeric protein, 
said chimeric protein comprises, from N-terminus to C-terminus: a) a first peptidyl 
fragment consisting of an amino acid sequence that has at least 40% identity to 
residues 1-50 of the B chain of native monellin, in which the percentage identity is 
determined over an amino acid sequence of identical size to the B chain of native 

15 monellin; b) a peptidyl bond, or a second peptidyl fragment consisting of 1-12 

amino acids; and c) a third peptidyl fragment consisting of an amino acid sequence 
that has at least 40% identity to residues 1-45 of the A chain of native monellin, in 
which the percentage identity is determined over an amino acid sequence of 
identical size to the A chain of native monellin, wherein said chimeric protein is 

20 stable and a given amount of said chimeric protein is at least 100-fold sweet as 
compared to the identical amount of sucrose, and within said nucleic acid, codons 
which are preferably used by yeast cells are used. Preferably, the recombinant 
Pichia pastoris cell contains a DNA molecule comprises nucleotide sequence as 
depicted in Figure 1 or a DNA vector as depicted in Figure 4. Recombinant 

25 Pichia pastoris cells containing the nucleic acids disclosed in Section 4.1. are also 
provided. 

Methods for transforming methylotrophic yeast, such as Pichia pastoris, as 
well as methods applicable for culturing methylotrophic yeast cells containing in 
their genome a gene encoding a heterologous protein, are known generally in the 
30 art. Preferably, the transformation, positive transformant selection and culturing 
methods disclosed in U.S. Patent No. 4,837,148; U.S. Patent No. 4,855,231; U.S. 
Patent No. 4,882,279; U.S. Patent No. 4,929,555; U.S. Patent No. 5,122,465; U.S. 
Patent No. 5,324,639 can be used in the present invention. 
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In another specific embodiment, the present invention provides a process for 
producing a monellin chimeric protein comprising growing a recombinant Pichia _^ 
pastohs cell containing the nucleic acid disclosed in Section 4.1. such that the 
encoded chimeric protein is expressed and secreted by the cell, and recovering the 
5 expressed and secreted chimeric protein. Preferably, the recombinant Pichia 

pastoris cell containing a DNA molecule comprises nucleotide sequence as depicted 
in Figure I or a DNA vector as depicted in Figure 4 is used. 

Any suitable fermentation process in the art can be used in the present 
process. For large-scale production of recombinant DNA-based products driven by 
10 a GAP promoter in methylotrophic yeast such as Pichia pastoris^ a three-stage, 

high cell-density fed batch fermentation system is preferably employed. In the first 
or growth stage of this fermentation system, the expression host Pichia pastoris 
cells are cultured in defined minimal medium such as BMGY (Buffered Minimal 
Glycerol-complex medium) with an excess of a non-inducing carbon source {e.g., 
15 glycerol). When the expression host Pichia pastoris cells are grown on such 
carbon sources, heterologous gene expression is repressed, which allows the 
generation of cell mass in the absence of heterologous protein expression. During 
this growth stage, it is also preferred that the pH of the medium be maintained at 
about 5. Next, the expression host Pichia pastoris cells are grown on lin^ited non- 
20 inducing carbon source for a short period of time to further increase the cell mass 
and to depress the glucose responsive promoter. The pH of the medium during this 
limited growth period is kept below 4, preferably in the range from about 2,0 to 
about 3.5. The final stage is the production stage wherein either the "glucose 
excess fed-batch mode" or the "mixed-feed fed-batch mode" can be used. In the 
25 "glucose excess fed-batch mode," 2% glucose alone is added. In the "mixed-feed 
fed-batch mode," a limiting amount of a non-inducing carbon source and glucose is 
added in the fermentor to induce the expression of the monellin gene driven by a 
GAP promoter. 

The secreted monellin chimeric proteins can be recovered from the Pichia 
30 pastoris culture medium by any methods known in the art. For example, methods 
disclosed in U.S. Patent Nos. 3,878,184 and 3,998,798; Morris and Cagan, 
Biochim. Biophys. Acta, 1972, 261:114-122; Kim et al., Protein Eng., 1989, 2:571- 
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575; and Recently, Kondo et al.. Nature Biotechnology^ 1997, 15:453-457 can be 
used for recovering and isolating the secreted monellin chimeric proteins. 
Preferably, the expressed and secreted chimeric protein is recovered by a means 
comprising ion-exchange chromatography. More preferably, the expressed and 
5 secreted chimeric protein is recovered by a means comprising CM-Sephadex 
column chromatography or DEAE-Sephadex column chromatography. 

In another specific embodiment, the present invention provides the product 
of the above processes. 

10 6. EXAMPLE 

6.1. Preparation of the Synthetic Recombinant Monellin DNA 

The amino acid sequence of the recombinant monellin protein and the 
nucleotide sequence of the DNA encoding the recombinant monellin protein are 
shown in Figure 1. As shov^n in Figure 1, nucleotides 1-150 encode residues 1-50 
15 of the B chain of the native monellin protein; nucleotides 150-152 encode Glycine 
as the linking "L" portion; and nucleotides 153-287 encode residues 1-45 of the A 
chain of the native monellin protein. The recombinant monellin protein is preceded 
by the following amino acid sequence, which corresponds to a Met residue and the 
signal sequence of Saccharomyces cerevisiae mating pheromone a- factor: Met-Leu- 

20 Leu-Phe-Ile-Asn-Thr-Thr-Ile-Ala-Ser-IIe-Ala-Ala-Lys-Glu-Glu-Gly-Val-Ser-Leu- 
Glu-Lys-Arg-Glu-Ala-Glu-Ala-Glu-Phe (SEQ ID NO:3). 

This synthetic DNA encoding the signal sequence of Saccharomyces 
cerevisiae mating pheromone a-factor and the recombinant monellin protein was 
prepared from the oligos M1-M4 and N1-N4, which were synthesized using the 

25 Applied Biosystems 380B DNA Synthesizer by ACTG company {see Figures 2-3 
and 5). The oligos were isolated by urea-polyacrylamide gel electrophoresis and 
purified by passing through a Sep-pak CIS column (Whatman) and annealed and 
ligated as shown in Figure 3 to obtain the synthetic DNA bracketed by EcoRI sites. 
To synthesize the DNA encoding the signal sequence of Saccharomyces 

30 cerevisiae mating pheromone a-factor and the recombinant monellin protein, in 100 
ul PCR reaction volume, 2 pM of each of the oligo M2 to N3 were mixed with 10 
pM M 1 and N4, heated to 94°C for 5 minutes in the absence of Tag DNA 
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polymerase. The reaction mixture was then slowly cooled down to 37°C. After 1 
unit of the Vent DNA polymerase (New England Biolabs, Inc.) was added, the 
PCR reaction was performed according to the standard protocol. One hundred 
microliter PCR reaction mixture contains 50 mM Tris-HCl (pH 8.0), 2.5 mM 
5 MgCl i, 10 mM DTT, 1 mM dNTP, and 1 unit of the Vent DNA polymerase. The 
PCR reaction was performed as following: at 94°C for 1 minute, 53"C for 1,5 
minutes, 72°C for 2 minutes within each cycle; and for a total of 30 cycles. 
Finally, the reaction mixture was incubated at 72**C for 10 minutes. The reaction 
mixture was extracted by phenol/chloroform, precipitated with ethanol, and gel 

10 purified in 1,2% low-melting agarose gel. The purified DNA fragment was 
inserted into the pTTbleu (R) vector (Novagen, Inc.) to generate the pT7yM 
plasmid {see Figure 5). In 20 ul DNA ligation reaction, 2 ul of 10 mM ATP, 40 
units of the T4 DNA ligase (New England Biolab, Inc.) was added and mixed with 
1 ug purified monellin DNA fragment and 50 ng pT7bIue (R) vector. The reaction 

15 mixture was kept at 16°C for 16 hours. The ligation mixture was transformed into 
host cells by adding 5 ul of the ligation mixture to 200 ul E. coH NovaBlue 
competent cells (Messing, Methods in Enzymology\ 1983, 101:20-78) and the 
desired sequence was confirmed by dideoxy sequencing using T7 and U19 primer 
(Sanger el al., Proc, Natl. Acad. ScL, 1985, 74:5463-5467). 
20 (1) 

6.2, Preparation of the Expression Vector pGWYS-1 

The pGAPZa expression vector was purchased from Invitrogen, Inc. The 
synthetic monellin DNA fragment was removed from pT7yMenaIlin with EcoRI 
and inserted into an EcoRI site of the pGAPZa vector to give pGWYS. Briefly, 5 

25 ug purified pT7yMenallion plasmid was digested in 20 ul reaction volume using 5 
units EcoRI (Promega Inc.) at 37°C for 2 hours. After the reaction mixture was 
separated by 1% low-melting agarose gel electrophoresis, the synthetic monellin 
DNA fragment was purified using the Wizard PCR Preps DNA purification kit 
(Promega, Inc). One hundred ng purified monellin DNA fragment were used for 

30 ligation into the expression pGAPZa vector. In the 10 ul ligation reaction, 50 ng 
of the EcoRI digested pGAPZa vector was mixed with 100 ng purified monellin 
DNA fragment. The ligation reaction was carried out in the presence of 10 ul of 
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20 mM Tris-HCl (pH 7.5), 10 mM MgCl, 10 mM DTT and 200 units of the T4 
DNA ligase (New England Biolabs, Inc.) at 16°C overnight to give the pGWYS. 
The ligation mixture was transformed into E, coli TOPI OF' cells (Invitrogen, Inc.). 
Twenty Zeocin-resistant clones were picked and orientation of the insert was 
5 screened by PGR reaction using the a-factor ( 5'CTATTGCCAGCATTGCTGC3') 
(SEQ ID NO:4) and the N4 oligos. Each selected clone was transferred into 3 ml 
LB culture medium containing 200 ug/ml Zeocin and incubated with shaking at 
37'*C overnight. The recombinant plasmid pGWYS was prepared from 1.5 ml 
cultured bacteria cells using the Qiagen Tip 20 kit system (Qiagen, Inc.). Fifty ng 

10 purified pGWYS plasmid was used as the PGR template to determine orientation of 
the insert. In 25 ul PGR reaction, 50 ng of the pGWYS plasmid was mixed with 
2.5 pM of the a-factor and the N4 oligos in the presence of 1 unit Taq DNA 
polymerase (Promega, Inc.)- The PGR reaction was performed under the following 
conditions: at 94X for 1 minute, 55X. for 1 minute, ITC for 2 minutes within 

15 each cycle; and for a total of 40 cycles. Finally, the reaction mixture was 

incubated at 72"C. for 10 minutes. After the PGR reaction mixture was separated 
on 1.2% agarose gel, one of the clones which contains the insert with the desired 
orientation was named pGWYS-1. The sequence of the insert was further 
confirmed by DNA sequencing. 

20 

6,3. Transformation of Pichia pastoris Cells with the pGWYS-1 

To generate the high-level and stable expression of monellin in Pichia 
pastoris, purified pGWYS-1 plasmid was transformed into Pichia pastoris cells by 
electroporation technique described in the Pichia pastoris Expression Kit Manual 

25 using the Electroporation Apparatus II (Invitrogen Inc.). Briefly, 500 ml of the 
Pichia pastoris GSl 15 cells were grown in YPD medium at 2>(fC to an OD(,oo of 
1.3. Gells were pelleted with a centrifugation of 1,500 g for 5 minutes at 4X. 
Pelleted cells were washed with 500ml of ice-cold sterile water. The washing step 
was repeated with 250 ml and 20 ml ice-cold sterile water, receptively. Then, the 

30 cells were washed with 20 ml of ice-cold 1 M sorbitol and resuspend in 1 ml ice- 
cold sorbitol. Forty ul of the yeast GSl 15 cells in 1 M sorbitol were mixed with 
10 ug purified pGWYS-1 plasmid to total volume 50 ul and the mixture was 
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transferred into an ice-cold cuvette. The cuvette containing the mixture was 
incubated on ice for 5 minutes. Electroporation was performed according to the 
Electroporation Apparatus II manual parameters (Invitrogen Inc manufacture). 
After the electrical pulse, 1 ml of ice-cold IM sorbitol was added into the cuvette, 
5 and the content of the cuvette was transferred into a microcentrifuge tube. Two 
hundred ul transformed cells were plated on one 5 RDB plate containing 400 ug/ml 
Zeocin. The plates were incubated at 30*^0 until colonies appeared. Positive 
transformants were characterized by their growth in the presence of Zeocin at 
various concentrations, e.g., 400 ug/ml, 600 ug/ml, 800 ug/ml and lOOOug/ml. 

10 

6.4. Stability Test of the Positive Transformants 

Three positive transformants were selected for further characterization based 
on their growth in the presence of 800ug/ml Zeocin and the expression of 
recombinant monellin by the 2% glucose induction. The following experiment was 
15 performed to test their genetic stability. Each of these 3 positive transformants was 
picked up using a sterile toothpick and incubated on a YPD plate without any 
selection at 30°C until colonies appeared. The colonies were picked up and plated 
on a new YPD plate until new colonies appeared. After such non-selective growth 
was repeated 50 times, each of the passage colonies was incubated on a selective 
20 plate containing 800 ug/ml Zeocin. The protein expression upon 2% glucose 

induction was analyzed by SDS-PAGE. All three positive transformants showed 
the same phenotype as the original colonies after 50 times passage on the YPD 
plates. 

6.5 Production of the Recombinant Monellin Protein 

25 Each of the three positive transformants selected in 5,4. was grown inl liter 

YPD medium in 5 liter flask at 30°C with vigorous shaking (250rpm). Two ml 
supernatant were obtained from the culture after 24 hours, 48 hours and 72 hours, 
respectively. Five ul of the samples collected at each time point were analyzed 
using the 15-20% gradient polyacrylamide gel. The secreted recombinant monellin 

30 protein was observed as the 12 kD protein band. Quantitation of the SDS-PAGE 
analysis using the Densitometer (Molecular Dynamic, Inc.) indicates that one of the 
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positive strain produced nearly 10 grams per liter secreted recombinant single-chain 
monellin protein. This strain was named GWySl. 

6.6 Purification of Secreted Recombinant Monellin Protein 

5 Protein methods were used to purify the recombinant monellin protein 

secreted from GwyS-1 yeast strain. According to the first method, after 72-hour 
culturing, supernatant was collected by a centrifugation at 12,000 rpm (17,000g). 
After collection, the supernatant pH was adjusted to about 6.8 using 0.1 N NaOH 
solution. One M NaH,P04-Na,HP0, (pH6.8) was added into the supernatant till 

10 1:100 (v/v) and mixed well. The supernatant was then loaded on the CM-Sephadex 
column (Phamacia, Inc.) pre-equilibrated with 0.01 M NaH,POa-Na:HPO, (pH6.8) 
solution. After the column was washed with 5 column volume 0.01 M NaH.POj- 
Na^HPO^ (pH6.8) solution, the recombinant monellin protein was eluted with 0.3 M 
NaCl-0.01 M NaH,PO,-Na,HPO, (pH6.8) solution. After dialysis against water, the 

15 purity of the protein was determined to be about 98% by gel electrophoresis. 

According to the second method, after 72-hour culturing, supernatant was 
collected by a centrifugation at 12,000 rpm (17,000g). After collection, the 
supernatant pH was adjusted to about 7.2 using 0.1 N NaOH solution. One M 
NaCl-1 M NaH.PO.-Na.HPO^ (pH 7.2) was added into the supernatant till 1:100 

20 (v/v) and mixed well. The supernatant was then loaded on the DEAE-Sephadex 

column (Phamacia, Inc.) pre-equilibrated with 1 M NaH,P0,-Na,HP04 (pH 7.2)- IM 
NaCl solution. The flow-through fraction was collected and dialyzed against water. 
The purity of the protein was determined to be about 98% by gel electrophoresis. 
The recombinant monellin protein purified according to either method was 

25 further lyophilized to dry powder for testing its sweetness. 

6.7. Sweetness and Stability Test 

Sweetness of the purified recombinant monellin protein was assessed using 
an ordinary taste test. Comparison to the sweetness of sucrose was made by 
30 suitable dilutions on a weight basis. In a typical test, 1, 10, 25 and 50 mg/ml 

aqueous sucrose solutions were used as standard solutions. The minimum weight of 
the purified recombinant monellin protein which could generate sweet taste was 
compared with that of sucrose. The recombinant monellin of this invention 
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requires the addition of an amount which is about 1000- fold less than that of 
sucrose. For example, 50 ng/ml recombinant monellin protein solution was as 
sweet as 50 mg/ml sucrose (Lucky Supermarket's Lady Lee brand sugar). 

Stability was measured by dissolving natural monellin (Sigma, Inc.) and the 
5 purified recombinant monellin protein at 100 ug/ml concentration at pH 2.0, 4.0, 
6.3. and 7.5. Each sample was heated to 37*^0, 50°C, 60°C, 70"C, 80°C, 90°C and 
100°C for 15 minutes and let cool to room temperature before tasting. The most 
dramatic difference was that natural monellin lost its sweetness when heated to 
50°C at pH 2.0, while the purified recombinant monellin protein retained its 
10 sweetness even after heating at lOO^C for 5 minutes. 



The present invention is not to be limited in scope by the microorganism 
deposited or the specific embodiments described herein. Indeed, various 
15 modifications of the invention in addition to those described herein will become 
apparent to those skilled in the art from the foregoing description and 
accompanying figures. Such modifications are intended to fall within the scope of 
the appended claims. 

Various references are cited herein, the disclosures of which are 
20 incorporated by reference in their entireties. 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid comprising a nucleotide sequence 
encoding a chimeric protein, said chimeric protein comprises, from N-terminus to 
5 C-terminus: 

a) a first peptidyl fragment consisting of an amino acid sequence that 
has at least 40% identity to residues 1-50 of the B chain of native 
monellin, in which the percentage identity is determined over an 
amino acid sequence of identical size to the B chain of native 

10 monellin; 

b) a peptidyl bond, or a second peptidyl fragment consisting of 1-12 
amino acids; and 

c) a third peptidyl fragment consisting of an amino acid sequence that 
has at least 40% identity to residues 1-45 of the A chain of native 

15 monellin, in which the percentage identity is determined over an 

amino acid sequence of identical size to the A chain of native 
monellin, 

wherein said chimeric protein is stable and a given amount of said chimeric protein 
is at least 100- fold sweet as compared to the identical amount of sucrose, and 
20 within said nucleic acid, codons which are preferably used by yeast cells are used. 

2. The isolated nucleic acid of claim 1, wherein the first 
peptidyl fragment consists of an amino acid sequence that has at least 60% identity 
to the B chain of native monellin. 

3. The isolated nucleic acid of claim 1, wherein the first 

25 peptidyl fragment consists of an amino acid sequence that has at least 90% identity 
to the B chain of native monellin. 

4. The isolated nucleic acid of claim 1, wherein the first 
peptidyl fragment consists of the amino acid residues 1-50 of the B chain of native 
monellin depicted as the amino acid residues 1-50 in Figure 1 (SEQ ID NO:5). 

30 5. The isolated nucleic acid of claim 1, wherein the second 

peptidyl fragment consists of the amino acid residue Gly. 
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6. The isolated nucleic acid of claim 1, wherein the second 
peptidyl fragment consists of the amino acid sequence Gly-Gly-Gly-Ser (SEQ ID- 
NO:2). 

7. The isolated nucleic acid of claim 1, wherein the second 

5 peptidyl fragment consists of the amino acid sequence Gly-Gly-Gly-Ser-Gly-Gly- 
Gly-Ser-Gly-Gly-Gly-Ser (SEQ ID NO:l). 

8. The isolated nucleic acid of claim 1, wherein the third 
peptidyl fragment consists of an amino acid sequence that has at least 60% identity 
to the A chain of native monellin. 

10 9. The isolated nucleic acid of claim 1, wherein the third 

peptidyl fragment consists of an amino acid sequence that has at least 90% identity 
to the A chain of native monellin. 

10. The isolated nucleic acid of claim 1, wherein the third 
peptidyl fragment consists of the amino acid residues 1-45 of the A chain of native 

15 monellin depicted as the amino acid residues 52-96 in Figure 1 (SEQ ID NO:5). 

1 1 . The isolated nucleic acid of claim 1 which nucleic acid 
encodes the amino acid residues 1-96 of Figure 1 (SEQ ID NO:5). 

12. The isolated nucleic acid of claim 1, wherein the chimeric 
protein is capable of being immunoreactively bound by an anti-monellin antibody. 

20 13. The isolated nucleic acid of claim 1, wherein the chimeric 

protein is capable of being immunoreactively bound by an anti-thaumatin antibody. 

14. The isolated nucleic acid of claim 1, wherein the chimeric 
protein further comprises an amino acid sequence which is capable of directing 
secretion of said chimeric protein from Pichia pasioris, 

25 15. The isolated nucleic acid of claim 14, wherein the secretion- 

directing sequence is an endogenous signal sequence of Pichia pastoris. 

16. The isolated nucleic acid of claim 15, wherein the 
endogenous signal sequence is selected from the group consisting of the signal 
sequence of Pichia pastoris acid phosphatase, Pichia pastoris aspartic proteinase 

30 and Pichia pastoris carboxypeptidase Y encoded by Pichia pastoris PRCl. 
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17. The isolated nucleic acid of claim 14, wherein the secretion- 
directing sequence is a yeast signal sequence, wherein said yeast is not Pichia 
pastoris. 

18. The isolated nucleic acid of claim 17, wherein the yeast 
5 signal sequence is a signal sequence from Saccharomyces cerevisiae, 

19. The isolated nucleic acid of claim 18, wherein the 
Saccharomyces cerevisiae signal sequence is selected from the group consisting of 
the signal sequence of Saccharomyces cerevisiae SUC 2 and Saccharomyces 
cerevisiae mating pheromone a-factor. 

10 20. The isolated nucleic acid of claim 19, wherein the 

Saccharomyces cerevisiae signal sequence is the signal sequence of Saccharomyces 
cerevisiae mating pheromone a-factor. 

21. The isolated nucleic acid of claim 1 1, further comprising an 
amino acid sequence which is capable of directing secretion of said chimeric 

15 protein from Pichia pastoris, 

22. The isolated nucleic acid of claim 21, wherein the secretion- 
directing sequence is the signal sequence of Saccharomyces cerevisiae mating 
pheromone a-factor, 

23. The isolated nucleic acid of claim 14, wherein the secretion- 
20 directing sequence is selected from the group consisting of the signal sequence of 

Aspergillus giganteus alpha-Sarcin, alpha-N-Acetylgalactosaminidase, OmpA 
protein, the mouse alpha-factor (cCell), the pepper endo-beta-l,4-glucanases, the 
laccase isolated from the ligninolytic fungus Trametes^ murine lysosomal acid 
alpha-mannosidase, the porcine inhibitor of carbonic anhydrase, Aspergillus 
25 awamori glucoamylase, mouse major urinary protein, phol, rabbit 

angiotensin-converting enzyme (ACE), and the bacterial thermostable alpha 
amylase. 

24. The nucleic acid of claim 1, wherein said nucleic acid is a 

DNA. 

30 25. An isolated nucleic acid comprising a nucleotide sequence 

complementary to the nucleotide sequence of claim 1. 
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26. An isolated nucleic acid hybridizable to the DNA sequence of 

claim 24. 

27. The DNA of claim 24, further comprising a promoter which 
is capable of directing protein expression in Pichia pastoris, 

5 28. The DNA of claim 27, wherein the promoter is an 

endogenous promoter of Pichia pastoris, 

29. The DNA of claim 28, wherein the endogenous promoter is 
the promoter of Pichia pastoris glyceraldehyde-3-phosphate dehydrogenase. 

30. The DNA of claim 24 said DNA encodes the amino acid 
10 residues 1-96 of Figure 1 (SEQ ID NO:5) and said DNA further comprises the 

promoter of Pichia pastoris glyceraldehyde-3-phosphate dehydrogenase and the 
signal sequence of Saccharomyces cerevisiae mating pheromone a-factor. 

31. The DNA of claim 24, wherein the codons which are 
preferably used by Pichia pastoris cells are used. 

15 32. A DNA molecule comprises nucleotide sequence as depicted 

in Figure 1 . 

33. The pGWYSl DNA vector as depicted in Figure 4. 

34. A recombinant Pichia pastoris cell containing the nucleic 
acid of claim 1 . 

20 35. A recombinant Pichia pastoris cell containing the DNA of 

claim 32. 

36. A recombinant Pichia pastoris cell containing the DNA of 

claim 33. 

37. A process for producing a chimeric protein comprising 

25 growing a recombinant Pichia pastoris cell containing the nucleic acid of claim 1 
such that the encoded chimeric protein is expressed and secreted by the cell, and 
recovering the expressed and secreted chimeric protein. 

38. A process for producing a chimeric protein comprising 
growing a recombinant Pichia pastoris cell containing the DNA of claim 32 such 

30 that the encoded chimeric protein is expressed and secreted by the cell, and 
recovering the expressed and secreted chimeric protein. 
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39. A process for producing a chimeric protein comprising 
growing a recombinant Pichia pastoris cell containing the DNA of claim 33 such- 
that the encoded chimeric protein is expressed and secreted by the cell, and 
recovering the expressed and secreted chimeric protein. 
5 40. The process of claim 37, wherein the expressed and secreted 

chimeric protein is recovered by a means comprising ion-exchange 
chromatography. 

41. The process of claim 40, wherein the ion-exchange 
chromatography being used is CM-Sephadex column chromatography. 
10 42. The process of claim 40, wherein the ion-exchange 

chromatography being used is DEAE-Sephadex column chromatography. 

43. The product of the process of claim 37. 

44. The product of the process of claim 38. 

45. The product of the process of claim 39. 
15 46. The product of the process of claim 40. 

47. The product of the process of claim 41. 

48. The product of the process of claim 42. 

49. A chimeric protein, said chimeric protein comprises, from N- 
terminus to C-terminus: 

20 a) a first peptidyl fragment consisting of an amino acid sequence that has at 

least 40% identity to residues 1-50 of the B chain of native monellin, in which the 
percentage identity is determined over an amino acid sequence of identical size to 
the B chain of native monellin; 

b) a peptidyl bond, or a second peptidyl fragment consisting of 1-12 amino 
25 acids; and 

c) a third peptidyl fragment consisting of an amino acid sequence that has at 
least 40% identity to residues 1-45 of the A chain of native monellin, in which the 
percentage identity is determined over an amino acid sequence of identical size to 
the A chain of native monellin, 

30 wherein said chimeric protein is stable and a given amount of said chimeric protein 
is at least 100-fold sweet as compared to the identical amount of sucrose, and 
within said nucleic acid, codons which are preferably used by yeast cells are used. 
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GGT GAG TGG GAG ATT ATT GAG ATT GGT CCA TTC ACT 
Gly Gly Trp GIu He He Asp He Gly Pro Phe Thr 

CAA AAC TTG GGT AAG TTC GCT GTT GAC GAG GAG AAC 
Gin Asn Leu Gly Lys Phe Ala Val Asp Glu Glu Asn 

AAG ATT GGT CAA TAG GGT AGA TTG ACT TTC AAC AAG 
Lys He Gly Gin Tyr Gly Arg Leu Thr Phe Asn Lys 

GTT ATT AGA CCA TGT ATG AAG AAG ACT ATT TAC GAG 
Val He Arg Pro Cys Met Lys Lys Thr He Tyr Glu 

AAC GAG GGT TCT AGA GAG ATT AAG GGT TAC GAG TAC 
Asn Glu Gly Ser Arg Glu He Lys Gly Tyr Glu Tyr 

CAA TTG TAC GTT TAC GCT TCT GAC AAG TTG TTC CGT 
Gin Leu Tyr Val Tyr Ala Ser Asp Lys Leu Phe Arg 

GCT GAC ATT TCT GAG GAC TAC AAG ACT CGT GGT CGT 
Ala Asp He Ser Glu Asp Tyr Lys Thr Arg Gly Arg 

AAG TTG TTG AGA TTC AAC GGT CCA GTT CCA CCA CCA 
Lys Leu Leu Arg Phe Asn Gly Pro Val Pro Pro Pro 

TAA 
Stop 



FIG.l 
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Ml 

5' AG A ATT CGG TGA GTG GGA GAT TAT TGA CAT TGG TCC ATT 
CAC TCA AAA CTT GG 3' 

M2 

5' GAA CAA GAT TGG TCA ATA CGG TAG ATT GAC TTT CAA CAA 
GTT TAT TAG GCC ATG T 3' 

M3 

5' GAG ACC GAG GGT TCT AGA GAG ATT AAG GGT TAC GAG TAC 
CAA TTG TAC GTT TAC GCT TC 3' 

M4 

5' GTG CTG ACA TTC CTG AGG ACT ACA AGA CTC GTG GTC GTA 
AGT TGT TGA GAT TC 3' 

Nl 

5' GTA TTG ACC AAT CTT GTT CTC CTC GTC AAC AGC GAA CTT 
ACC CAA GTT TTG AGT GAA TG 3' 

N2 

5' CTC TAG AAC CCTCGT TCT CGT AAA TAG TCT TCT TCA TAC 
ATG GTC TAA TAA CCT TG 3 ' 

N3 

5' GTC CTC AGA AAT GTC AGC ACG GAA CAA CTT GTC AGA AGC 
GTA AAC GTA CAA TTG 

N4 

5' AGA ATT CTT ATG GTG GTG GAA CTG GAC CGT TGA ATC TCA 
ACA ACT TAC GAC 3' 



FIG. 2 
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Ml M2 M3 M4 

m^^mm mm^mmm^ EcoRI 



EcoRI 



Nl N2 N3 N4 



FIG. 3 
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Comments for pGWYSI (3479 bp) 

GAP Promoter region: 1 -483 
Alpha-factor signal sequence: 493-760 
Monellin coding region: 762-1059 
3' AOX 1 termination region: 1060-1306 
TEF1 Promoter region: 1307-1709 
EM7 Promoter region: 1710-1781 
Sh ble ORF: 1782-2518 
CYC1 termination region: 2159 2477 
ColEI origin ( pUC-derived): 2478-3479 



FIG. 4 
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i 



PGR amplification with 
mived M and N oligos 




pT7 cloning vector 



_^ ligation 



Recombinant single chain 
Monellin gene 




i 



EcoR I digestion 



Monellin gene EcoRI fragment 



GAP 




EcoRI 



GAP 



EcoR I digestion Ligation 



pGAPZa 




EcoRI 



EcoRI 



pGWYSl 



FIG. 5 
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JL)|2J13JH 



Lane 1 . Protein MW Marker 




Lane 2. 5ul Culture Medium 

Lane 3. Partially Purified Recombinant 
Single Chain Monellin 

Lane 4. 40ug Native Monellin 



FIG.6 
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Recombinant Yeast Clone 



^ Culture Time: 48 hours Medium: YPD 

Temperature: 30 °C Shaking Speed: 150 rpm 

▼ 



20 ml Culture 



Culture Jime:2 4 hours Medium: YPD 
Temperature: 30 °C Shaking Speed: 150 rpm 



I Liter Culture 



Culture Time: 12 hours 


Medium: YPD 


Temperature: 28 - 30 °C 




Shaking Speed: 300 rpm 


pH 6.4 



20 Liter Culture 





Culture Time: 12 hours 


Medium: M5 


< 


Temperature: 28 - 30 °C 
Shaking Speed: 300 rpm 






pH 6.4 



400 Liter Culture 



Recovery Supernatant By 
Centrifugation 
12,000 g. 10 mimutes 



FIG. 7a 
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Recovery Culture Supernatant 
By Centrifugation 
12,000 g 10 mimutes 



I 



Method 1 



Adjust Supernatant pH to 6.8 
Add 1/100 Vol IM IM NaCl 



i 



Load to CM-Sephadex column 



Elute Protein Sample With 
0.01 M NaHjPO^-NajHPO^ 
( pH6.8) 0.3 N NaCl 



i 



Method 2 



i 



Adjust Supernatant pH to 7.2 
Add 1/100 Vol 
IM NaH^PO^-Na^HPO^ 



Load to DEAE-Cellulose 
DE52 column 



I 



Collecting Protein Sample 



i 



Desalt 



Concentrating Protein Sample 
0.3KD-0.6KD MW Cut Off 



FIG. 7b 
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the international application as filed has been furnished. 

□ The statement that the information recorded in computer readable form is identical to the written sequence 
listing has been furnished. 

4. The amendments have resulted in the cancellation of: 



Claims, No.: 



1-49 



as originally filed 



Drawings, sheets: 



1/8-8/8 



as originally filed 
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□ the description, pages: 

□ the claims, Nos.: 

□ the drawings, sheets: 

5. □ This report has been established as if (some of) the amendments had not been made, since they have been 

considered to go beyond the disclosure as filed (Rule 70.2(c)): 

(Any replacement sheet containing sucfi amendments must be referred to under item 1 and annexed to tliis 
report.) 

6. Additional observations, if necessary: 



V. Reasoned statement under Article 35(2) with regard to novelty, inventive step or industrial applicability; 
citations and explanations supporting such statement 

1. Statement 



Novelty (N) 


Yes: 


Claims 


6, 7, 11, 14-23, 27-48 




No: 


Claims 


1-5, 8-10, 12, 13, 24-26, 49 


Inventive step (IS) 


Yes: 


Claims 


6, 7, 11, 32,33 




No: 


Claims 


1-5. 8-10, 12-31, 34-49 


Industrial applicability (lA) 


Yes: 


Claims 


1-49 




No: 


Claims 





2. Citations and explanations 
see separate sheet 

Vil. Certain defects in the international application 

The following defects in the form or contents of the international application have been noted: 
see separate sheet 

Vill. Certain observations on the international application 

The following observations on the clarity of the claims, description, and drawings or on the question whether the 
claims are fully supported by the description, are made: 
see separate sheet 
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Re Item I 

Basis of the report 

1 . This report was established based on the application docunnents and the written 
sequence listing pages 1-5. 



Re Item V 

Reasoned statement under Article 35(2) with regard to novelty, inventive step or 
industrial applicability; citations and explanations supporting such statement 

1 . Reference is made to the following documents: 

D1 : KONDO K ET AL: 'High-level expression of a sweet protein, monellin, in the 
food yeast Candida utilis" NATURE BIOTECHNOLOGY, vol. 15. no. 5, May 
1997 (1997-05), pages 453-457, XP000910200 cited in the application 

D2: VEDVICK TS: 'Gene expression in yeast: Pichia pastoris' CURR. OPIN. 
BIOTECHNOL, vol. 2, no. 5, October 1991 (1991-10), pages 742-745, 
XP0009 15828 

D3: CREGG ET AL: 'Recent advances in the expression of foreign genes in 
Pichia pastoris' BIO/TECHNOLOGY, vol. 11. 1993, pages 905-910, 
XP002 100307 

D4: US-A-3 998 798 (UNIVERSITY OF PENSYLVANIA) 21 December 1976 

(1 976-1 2-21 ) cited in the application 
D5: WO 88 1 0265 A (UNIVERSITY OF CALIFORNIA) 29 December 1 988 (1 988- 

12-29) cited in the application 
D6: EP-A-0 821 055 (INT FLAVORS & FRAGRANCES INC) 28 January 1998 

(1998-01-28) 



2. Novelty (Article 33(2) PCT) 
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The present application discloses the production of recombinant single-chain 
chimeric monellin in the methylotropic yeast Pichia pastoris. 

2.1 . D1 (cited in the application) discloses the heterologous expression of a 
single-chain monellin gene in the yeast Candida utilis under the control of a C. 
utilis promoter. The single-chain monellin (Fig. 2B) is of the formula B-C-A, 
wherein C is a linker consisting of Gly. D1 is thus detrimental to the novelty and 
inventive step of claims 1-5, 8-10, 12, 13, 24-26 and 49. 

2.2. D5 (cited in the application) discloses single chain monellin analogues and their 
recombinant production. The analogues have the formula B-C-A. C is a covalent 
linker 3-10 amino acids long. Limited modifications of the A and B chain portions 
can be included. The compound with a reported sweetness at least 50 times that 
of sucrose has 96.9% identity to SEQ ID No:5. D5 is thus detrimental to the 
novelty and inventive step of claims 1-4, 8-10, 12, 13, 24-26 and 49. 

2.3. For the sake of completion, it is noted that D6 also discloses single chain monellin 
analogues of the formula B-C-A. C is a linker that can be Gly as in the present 
application (see Fig. 3-B(i) and (ii)). Sequence Id 4 has 96.9% Identity with the 
sequence of the present application while Sequence Id 7 has 97.9% identity. D6 is 
thus detrimental to the novelty and inventive step of claims 1-4, 8-10, 12, 13, 24- 
26 and 49. 



3. Inventive step (Article 33(3) PCT) 

3.1 . D2 and D3 disclose the versatility and utility of the Pichia pastoris expression 
system. Together with the teachings of D1 on recombinant expression of the 
single-chain monellin, they render the subject-matter of claims 14-23, 27-31 and 
34-48 obvious. 



3.2. D4 (cited in the application) discloses the purification of monellin from the crude 
fresh fruit of D. cumminsii by DEAE-cellulose and CM-cellulose ion exchange 
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chromatography. This teaching deprives inventive step from claims 40-42. 



4. Industrial applicability (Article 33(4) PCX) 

The subject-matter of present claims 1-49 appears to be industrially applicable 
under the terms of Article 33(4) PCT. 



Re Item Vil 

Certain defects in the international application 

1 . Contrary to the requirements of Rule 5.1 (ii) PCT, documents D2, D3 and D6 are 
not identified in the description and the relevant background art disclosed therein 
is not briefly discussed. 

2. Claim 32 relies on a reference to a figure, contravening Rule 6.2(a) PCT. 



Re Item VIII 

Certain observations on the international application 

1 . Claim 1 is related to a nucleic acid wherein "codons which are preferably used by 
yeast cells are used". This definition is not a true technical feature of the claimed 
compound and renders said claim not-enabled and unclear under the terms of 
Articles 5 and 6 PCT respectively. 



2. Claim 14 does not meet the requirements of Article 6 PCT in that the matter for 
which protection is sought is not clearly defined. The claim attempts to define the 
subject-matter in tenns of the result to be achieved (secretion of said protein from 
Pichia pastoris) which merely amounts to a statement of the underlying problem. 
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The technical features necessary for achieving this result should be added. 



3. Claim 26 relates to a nucleic acid "hybridisable to the DNA sequence of claim 24". 
Such a definition is too vague and lacks clarity in the sense of Article 6 PCT. It is 
furthermore noted that such a broad term jeopardises the novelty of the claim. 
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